A Supervised Discretization Method for Quantitative and Qualitative Ordered Variables

نویسندگان

  • Francisco Javier Ruiz
  • Cecilio Angulo
  • Núria Agell
چکیده

In this work, a new technique to define cut-points in the discretization process of a continuous attribute is presented. This method is used as a prior step in a regression problem, considered as a learning problem in which the output variable can be either quantitative (continuous or discreet) or qualitative defined over an ordinal scale. The proposed method emphasizes the concept of location to determine discretization cut-points. In the case of continuous outputs, the method is based on the maximization of the difference between distributions by using intervalar distances. In the case of qualitative outputs, a qualitative distance is defined over a structure of absolute orders of magnitude. The main characteristics of the method presented are illustrated through three examples, two for continuous outputs and the last for a qualitative output.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discretizing Continuous Attributes While Learning Bayesian Networks

We introduce a method for learning Bayesian networks that handles the discretization of continuous variables as an integral part of the learning process. The main ingredient in this method is a new metric based on the Minimal Description Length principle for choosing the threshold values for the discretization while learning the Bayesian network structure. This score balances the complexity of ...

متن کامل

A New Hybrid Framework for Filter based Feature Selection using Information Gain and Symmetric Uncertainty (TECHNICAL NOTE)

Feature selection is a pre-processing technique used for eliminating the irrelevant and redundant features which results in enhancing the performance of the classifiers. When a dataset contains more irrelevant and redundant features, it fails to increase the accuracy and also reduces the performance of the classifiers. To avoid them, this paper presents a new hybrid feature selection method usi...

متن کامل

Systematic Review of Quantitative and Qualitative Research on Divorce Factors

Background: Divorce has always been one of the five main issues in the country and one of the criteria for community health. Therefore the purpose of this study is to investigate quantitative and qualitative researches on the factors affecting divorce through systematic review. Method: The research community includes all the quantitative and qualitative articles published (2006-2016) regarding...

متن کامل

Landscape assessment of high-rise buildings: A method based on 3DGIS, BIM and AHP

In this paper, we propose a quantitative indicator for measuring and ranking real estate from the perspective of the surrounding landscape by integrating the building information model (BIM) and the three-dimensional geospatial information system (3DGIS) based on the AHP method. The landscape is one of the qualitative variables which it's measuring and quantifying is a complex task. In previous...

متن کامل

Optimal Bayesian 2D-Discretization for Variable Ranking in Regression

In supervised machine learning, variable ranking aims at sorting the input variables according to their relevance w.r.t. an output variable. In this paper, we propose a new relevance criterion for variable ranking in a regression problem with a large number of variables. This criterion comes from a discretization of both input and output variables, derived as an extension of a Bayesian non para...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computación y Sistemas

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2006